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SYSTEM AND METHOD FOR MONITORING AND DIAGNOSIS OF 
VIDEO NETWORK PERFORMANCE 



TECHNICAL FIELD OF THE INVENTION 

This invention relates generally to audio visual 
network communications, and more specifically relates to 
a system and method for monitoring and diagnosis of video 
network and device performance in transferring audio 
visual data over a video network. 
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BACKGRO UND OF THE INVENTION 

Video networks communicate audio visual data to 
allow participants in a video call to see and hear each 
other even over extended distances. Typically, video 
endpoints collect video images and audio signals from 
local participants, digitize the video images and audio 
signals, and compress the video images and audio signals 
for transfer over a network to a remote video endpoint . 
The digitized audio visual data is typically transferred 
over a number of different types of networks and with a 
number of different types of protocols and then converted 
to images and sounds at the remote location by 
decompressing the data and converting the data into video 
images and audio signals for presentation at the remote 
site . 

Digitization, compression and decompression 
functions for transfer of audio and visual data are 
typically performed with specialized hardware known as 
CODECs, an abbreviation for compression\decompression. 
Compressed audio visual data may be sent through networks 
with equipment having a variety of degrees of 
specialization. For instance, a simple video 
teleconference might involve a direct dial up between two 
video endpoints, such as with an ISDN or T-l connection. 
More complex video calls often involve specialized 
equipment such as multicall units (MCU) that coordinate 
conferences between three or more endpoints and such as 
gateway devices that coordinate communication between 
devices using different protocols. Such specialized 
devices are deployed in video networks to allow access by 
video endpoints as needed when a video call is configured 
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and initiated across the video network. Alternatively, 
in TCP\IP networks that support video and voice over 
Internet Protocol, the functions of such specialized 
equipment are typically performed by software modules 
loaded on servers of the video network. 

When a video call is established between video 
endpoints, the audio visual data is typically not 
accessible at a packet- level or CODEC-level as the audio 
visual data is routed between endpoints and through 
various video devices. Generally, video devices are 
designed to pass through audio visual data without 
extensive processing in order to avoid delays in the 
video conference presentation. For instance, the H.323 
video conference protocol enables services such as call 
forwarding but avoids exposing audio visual data to a 
packet level or CODEC level as the data passes through 
the video network nodes. Although compliance with the 
H.323 protocol aids in establishing and maintaining high 
quality videoconf erences , when quality problems do arise 
the inaccessibility of audio visual data as the data 
transfers through the video network makes the diagnosis 
of the problems difficult. 
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SUMMARY OF THE INVENTION 

Therefore a need has arisen for a system and method 
which provides monitoring and diagnosis of video device 
performance in the transferring of audio visual data over 
5 a video network. 

A further need has arisen for a system and method 
which provides real time monitoring and diagnosis of 
video device performance without introducing delays to 
audio visual data transfers. 
10 In accordance with the present invention, a system 

y± and method are provided which substantially reduces the 

5? problems and disadvantages of diagnosing video device 

("less? 

Hi performance in the transfer of audio visual data over a 

yL video network. A diagnostic tool receives audio visual 

^ 15 data associated with a video device and accesses the 
in audio visual data to a predetermined level needed for 

diagnosis of the performance of the video device. 
W More specifically, a diagnostic node interfaces with 

m a video network to receive audio visual data associated 

2 0 with one or more video devices of the video network. For 
instance, the diagnostic node operates as a transparent 
passthrough/proxy for a video device or as a regular 
terminating route in order to evaluate audio visual data 
and determine performance statistics associated with a 

2 5 predetermined video device that is proximate the 

diagnostic node. The diagnostic node includes a physical 
network interface with the video network and is 
deployable as a headless, dedicated, client-enabled 
device which hosts diagnostic tools operable to access 

3 0 audio visual data associated with a video device as the 

audio visual data passes through the network. 
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The diagnostic node includes diagnostic tools for 
accessing the audio visual data, such as an audio CODEC , 
a video CODEC and a packet sniffer. A diagnostic engine 
coordinates the evaluation of audio visual data by the 
diagnostic tools. For instance, the audio and the video 
CODECS expose the audio visual data for analysis of a 
broad range of statistics such as jitter, latency, 
throughput, packet loss and lip-synch, with analysis of 
audio data alone, video data alone or combined data. 
Similarly, the packet sniffer exposes audio visual data 
in TCP/IP packets for analysis of such statistics. 
Performance statistics are then forwarded through a 
communication agent, such as a web server or SNMP agent 
of the diagnostic node, through the video network for 
monitoring and analyzing the performance of the video 
device associated with the evaluated audio visual data. 

The present invention provides a number of important 
technical advantages. One important technical advantage 
is that a diagnostic node determines performance 
statistics for one or more video devices with a dedicated 
and compact device having a simple physical interface to 
the video network. Thus, a dedicated device easily 
interfaces with the video network to allow a packet or 
CODEC- level evaluation of audio visual data and provide 
performance statistics otherwise unavailable due to the 
inaccessibility of the audio visual data by video 
devices. The dedicated device interfaces as a stand 
alone device able to forward the performance statistics 
for the desired video device through the video network. 

Another important technical advantage of the present 
invention is that performance statistics for a 
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predetermined video device are provided without 
introducing any substantial delays in the transfer of 
audio visual data across the video network. For 
instance, a diagnostic node is deployed as an H.323 
endpoint or as an H.323 passthrough\proxy to provide 
diagnosis of video device performance in a transparent 
manner. Control of the diagnosis and communication of 
the performance statistic results may be provided through 
a communication agent, such as a web server or SNMP 
agent, that allows analysis of a network through 
distribution of diagnostic nodes at distal points of a 
video network. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

A more complete understanding of the present 
embodiments and advantages thereof may be acquired by 
referring to the following description taken in 
conjunction with the accompanying drawings, in which like 
reference numbers indicate like features, and wherein: 

FIGURE 1 depicts a block diagram of a diagnostic 
node deployed in a video network; and 

FIGURE 2 depicts a block diagram of a diagnostic 

node . 
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DETAILED DESCRIPTION OF THE INVENTION 

Preferred embodiments of the present invention are 
illustrated in the FIGURES, like numerals being used to 
refer to like and corresponding parts of the various 
drawings . 

Video devices pass large quantities of audio visual 
data through video networks in order to support a video 
call. The audio visual data is compressed and 
decompressed by video endpoints to more effectively use 
video call bandwidth. However, the transmission of 
compressed audio visual data makes the diagnosis of 
problems with a video device difficult to accomplish. 
For instance, many deployed CODECS in H.32 3 protocol 
compliant devices do not expose audio visual data to 
allow analysis of video device performance. Thus, to 
detect and diagnose video network problems, packet -level 
and CODEC- level exposure of data from nodes of the video 
network is often needed. 

The present invention provides detailed level 
analysis of audio visual data through the distribution of 
one or more diagnostic nodes as dedicated devices 
interfaced with the video network. The diagnostic node 
allows analysis of audio visual data at a network layer 
to provide the performance statistics that allows real 
time monitoring and diagnosis of video devices. 
Referring now to FIGURE 1, a block diagram depicts 
diagnostic nodes 10 deployed in a video network 12 
between video endpoints 14 and another video device 16, 
such as a video endpoint, MCU or gateway. Diagnostic 
node 10 analyzes audio visual data passed through video 
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network 12 to determine performance statistics such as 
jitter, latency, throughput, packet loss, lip-synch, etc. 

Diagnostic node 10 is deployed as an H.323 protocol 
compliant terminating route or an H.323 protocol 
compliant pass through proxy so that audio visual data 
passed through network 12 is available to the diagnostic 
node 10 without interfering with any on-going video 
calls. Diagnostic node 10 interfaces with video network 
12 proximate to the video endpoint 14 or video device 16 
being diagnosed and provides the results of the analysis 
of audio visual data associated with the video device 
through the video network, such as to a video network 
platform 18. For instance, the video network may be a 
local area network so that diagnostic node 10 is deployed 
as a compact, headless, dedicated client -enabled device 
distributed through the video network and interfaced with 
an Ethernet physical network interface. Diagnostic node 
10 thus runs on a very thin hardware machine, such as a 
400 MHz Pentium 3 with 32 megabytes of RAM and a 2 
gigabyte hard drive that supports the network interface 
and hosts the diagnostic tools and engines. 

The distribution of diagnostic nodes 10 through a 
video network permits a detailed analysis of audio visual 
data otherwise unavailable through network layer video 
device communications. Communication over the video 
network 12 with a video network platform 18 provides 
remote control and tracking of performance statistics 
generated by diagnostic node 10 so that the diagnostic 
node is simply a plug-in network device. Further, when 
deployed as a passthrough\proxy as indicated by dotted 
line 20, the diagnostic node 10 introduces minimal delay 
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while obtaining full access to communications with a 
desired video device. In an embodiment deployed in a 
video over Internet Protocol network, the diagnostic node 
may also be a software module running on a server. 

Deployed in the configuration illustrated by FIGURE 
1, the diagnostic nodes 10 isolate a difficulty in the 
transmission of audio visual data by determining which 
video network device is introducing errors to the data. 
For instance, if a lip sync problem exists at video 
endpoint 14B, then diagnostic nodes 10A and 10B determine 
the point in the network at which errors are introduced 
by measuring the lip sync quality. As audio visual data 
is transmitted from video endpoint 14A through the video 
network 12 to video endpoint 14B, diagnostic node 10A 
determines lip sync quality associated with video device 
16A and diagnostic node 10B determines lip sync quality 
associated with video device 16B. If no lip sync error 
is associated with video device 16A but a determined 
level of lip sync error is associated with video device 
16B, then the lip sync problem is isolated to video 
device 16B. Similarly, if a lip sync error of a 
determined level is associated with video device 16A and 
the level of the error remains unchanged at video device 
16B, then the lip sync problem is isolated to video 
device 16A. By measuring changes in quality of lip sync 
as audio visual data is transmitted through the video 
network, diagnostic nodes 10 isolate problems in more 
than one video device, such as by comparing the error 
measurements sent to video network platform 18. 

Referring now to FIGURE 2, a block diagram depicts 
one embodiment of a diagnostic node 10 interfaced through 
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a network 12 to diagnose an associated video device 16, 
such as a video endpoint, MCU or gateway. Diagnostic 
node 10 communicates through a network interface 22 that 
physically connects diagnostic node 10 with network 12, 
such as an Ethernet interface to a local area network. 
Network interface 22 is a client -enabled device, such as 
with DHCP to support an Ethernet physical interface with 
a local area network that runs on such conventional 
operating systems as Linux or Windows. Communication 
agents such as a web server 24 or SNMP agent 2 6 provide 
access to diagnostic node 10 from clients interfaced 
through network 12 and network interface 22 to support 
control of video device monitoring and diagnosis as well 
as to communicate performance statistics. For instance, 
a browser on VNP 18 allows a control and reporting module 
3 0 to communicate with diagnostic node 10 through web 
server 24. Alternatively, VNP 18 communicates directly 
with diagnostic node 10 using SNMP and SNMP agent 26. 

Diagnostic engine 32 in diagnostic node 10 
communicates with diagnostic tools, such as audio CODEC 
34, video CODEC 3 6 and packet sniffer 38, to monitor 
audio visual data routed through diagnostic node 10 and 
to determine performance statistics. Audio CODEC 34 
includes one or more instrumented audio CODECS, such as 
audio CODECS compliant with G.711, G.722, G.723, G.728, 
and G.729. Video CODEC 36 includes one or more 
instrumented video CODECS such as video CODECS compliant 
with H.261 and H.263. Packet sniffer 38 provides 
analysis of TCP/IP packets such as those that route audio 
visual data as video over Internet Protocol. A video 
device proxy 4 0 provides compliance with an appropriate 
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protocol, such as compliance with the call signaling and 
control of H.323 protocol. Thus, video device proxy 40 
supports deployment of diagnostic node 10 as an endpoint 
or a transparent H.323 passthrough\proxy . In this 
manner, audio visual data associated with video device 16 
is routed to diagnostic node 10 and video device proxy 40 
may determine appropriate audio visual data for analysis. 

The instrumented CODECS 34 and 3 6 and the packet 
sniffer 3 8 allow diagnostic node 10 to have a detailed 
exposure to audio visual data associated with the 
monitored video device 16. Thus, diagnostic engine 32 
uses this CODEC-level and packet-level exposure to 
interpret audio visual data at the desired level of 
detail and get audio and video metrics for detailed 
analysis of the performance of video device 16. For 
instance, diagnostic engine 32 accepts audio visual data 
from an MCU video device and, essentially, plays the 
video call using the CODECs 34 and 36 to detect, as an 
example, lip-synch problems. Thus, if a difficulty 
arises during a video call, the deployment of plural 
diagnostic nodes 10 across the video network aids in 
pinpointing the video device or video devices introducing 
the difficulty. As a further example, the audio and 
visual codecs allow separate analysis of audio jitter and 
video jitter for a give video device. 

Although the present invention has been described in 
detail, it should be understood that various changes, 
substitutions and alterations can be made hereto without 
departing from the spirit and scope of the invention as 
defined by the appending claims. 
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